ON CONTINUITY EQUATIONS IN INFINITE DIMENSIONS 
WITH NON-GAUSSIAN REFERENCE MEASURE 



ALEXANDER V. KOLESNIKOV AND MICHAEL ROCKNER 

Abstract. Let 7 be a Gaussian measure on a locally convex space and H 
be the corresponding Cameron-Martin space. It has been recently shown by 
L. Ambrosio and A. Figalli that the linear first-order PDE 

p + div 7 (p-6) = 0, p\ t=0 = p , 

where po ' 7 is a probability measure, admits a weak solution, in particular, 
under the following assumptions: 

\\b\\n 6 L p (7), V > 1, e x p(e(div 7 f>)_) 6 L 1 ^). 
Applying transportation of measures via triangular maps we prove a similar 
result for a large class of non-Gaussian probability measures u on M°°, under 
the main assumption that ft £ n rl gp}L n (i') for every i £ N, where ft is the 
logarithmic derivative of v along the coordinate Xi. We also show uniqueness 
of the solution for a wide class of measures. This class includes uniformly 
log-concave Gibbs measures and certain product measures. 



1. Introduction 
In this paper we study infinite-dimensional continuity equations 

(1) /i + div(/i) = 0, fi = C) 

where p = p t {dx), t > 0, is a curve of probability measures on R°° equipped 
with the product er-algebra induced by the Borel cr-algebra on K and b : IR 00 — > 
R°°. Furthermore, p = -j^p, div is meant in the sense of distributions and £ is a 
probability measure on R°° serving as the initial datum. One approach to solve 
equation ([TJ is to choose a reference measure £ and search for solutions for |T]) with 
C = po ■ v which are of the form pt{dx) — p{t,x) ■ v(dx). When (JT]) can be written 
as 

(2) p + div„0 ■ b) = 0, p(0, x) = po, 

where div„ is the divergence with respect to i.e. (—1) times the adjoint of the 
gradient operator on L 2 (IR 00 , v) . We stress that the choice of the reference measure 
(even in the finite-dimensional case, where M°° is replaced by R d ) is at our disposal 
and should be made depending on b. For instance, in the finite-dimensional case 
b might be in a weighted Sobolev class with respect to some measure v absolutely 
continuous with respect to Lebesgue measure, but not weakly differentiable with 
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respect to Lebesgue measure itself. Then one should take v to be the measure 
for which the components of b are in W 1,2 (i/). So, singularities of b will thus be 
compensated by the zeros of the Lebesgue density of v. 

Likewise in the infinite dimensional case of R°° , where one usually takes a Gauss- 
ian measure as reference measure, since they are best studied. However, in many 
cases this is not the best choice, for similar reasons as we have just seen, in the 
case of R d . For instance, there are interesting examples (presented in Section 7.2 
below), where the reference measure should be taken to be a Gibbs measure, whose 
energy functional can be " read off" the given map b which determines equation (jTJ) , 
respectively ([2]). 

The key point, that for such reference measures we can identify conditions so 
that @ has a solution and/or that this solution is unique, lies in the fact that 
many probability measures on R d are images of Gaussian measures under so-called 
triangular mappings which turn out to have sufficient regularity in many concrete 
situations. Therefore, we can reduce existence and uniqueness questions {T]), re- 
spectively ((2]) to the case of a Gaussian reference measure, studied in [7] and [20] . 

To explain this and also to review a bit the history of the problem, let us return 
to equation ([2|) and recall that the associated Lagrangian flow has the form 

(3) X(t,x) = b(X), X(0,x) = x. 

A finite-dimensional theory of equations ([T]) and ((2]) for weakly diffcrentiablc drifts 
b has been deeply developed in a recent series of papers by L. Ambrosio. G. Crippa, 
C. De Lellis, G. Savare, A. Figalli and others (see [5] and the references therein). 
This theory works under quite general assumptions and includes, in particular, 
existence and uniqueness results for BV (bounded variation) vector fields. 

Relatively little is known, however, in the infinite-dimensional setting. The first 
results in this direction have been obtained by A.B. Cruzeiro QI5], V.I. Bogachcv 
and E Mayer- Wolf [15]. The starting point for us was the paper [7], where some 
finite-dimensional techniques (including the Di Perna-Lyons theory of renormalizcd 
solutions) have been generalized to the infinite-dimensional Gaussian case. Other 
recent developments can be found in Ambrosio-Figalli 6 , Le Bris-Lions [28] , Fang- 
Luo [20] , Bogachev-Da Prato-Shaposhnikov-Rockner [16] . 

We stress that the uniqueness of the solution is a more difficult problem compared 
to the existence. The latter can be established under quite broad assumptions (see, 
for instance, [16] for the apparently most general results about existence). The 
uniqueness proof obtained in [7] relies very strongly on the Gaussian framework. 
An important technical point was smoothing by the Ornstein-Uhlenbeck semigroup 
which behaves very nicely with respect to many natural operations on the Wiener 
space (divergence, projections, conditional expectations, differentiation etc.). The 
absence of such a nice smoothing operator seems to be the main difficulty when 
one tries to solve ([2]) for non-Gaussian reference measures. 

In this paper we prove an existence result for the case of reference measures v 
on R°° with logarithmic derivatives integrable in any power. We also show unique- 
ness for a wide class of product measures, including log-concave ones. Another 
uniqueness result is proved for a class of uniformly log-concave Gibbs measures. 

Our approach relies on the mass transportation method. The general scheme 
works as follows. Instead of directly solving ([2]) we consider a mass transportation 
mapping T : R°° — > R°° pushing forward the standard Gaussian measure 7 onto 
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v: v = 7 ° T 1 . If Vi = pt ■ v is the solution to @ , then the family of measures 
7t = Vt o S^ 1 with S = T^ 1 solves the continuity equation for the new vector field 

(4) c = DT^ 1 ■ b(T), 

here D denotes total derivative. Applying (slightly generalized) existence and 
uniqueness results for the Gaussian case from [7J , we get a solution "f t of the equation 
associated to the vector field c and transfer it back, i.e. vt — It ° T^ 1 . 

The main advantage of this approach is that the divergence operator commutes 
with T: 

div 7 c = [divj/b] ° T, 

Hence the crucial assumptions on div 7 c can be directly transferred to div^fr. On the 
other hand, assumptions on integral norms of c and Dc impose some restrictions 
on Sobolev norms of T and S = T^ 1 . To prove the corresponding a-priori bounds 
is the main technical difficulty of our approach. 

Note that we are free to choose any type of transportation mappings provided 
they have sufficient regularity. In this paper we deal with triangular mass trans- 
portation. A short discussion about the optimal transportation approach can be 
found in the very last section of this paper. The advantage of these mappings is 
their simple form. Even in the infinite-dimensional case they have essentially finite- 
dimensional structure. We obtain some Sobolev estimates on S and deduce from 
them the existence result for The key estimate for triangular mappings applied 
in this paper looks as follows. Let S = J2i &i ' e » be the triangular mapping pushing 
forward the measure v onto the standard Gaussian measure 7. Then 




Here /3j is the logarithmic derivative of v along Xj . For more details on triangular 
mappings see [S]. 

The paper is organized as follows. In Section 2 we prove an extension of the 
results from [7j. In particular, we weaken some assumptions in [Jj by introducing 
a slightly weaker notion of solution (see Remark 12. 1|) . In Section 3 we establish 
Sobolev estimates for triangular mappings. In Section 4 we prove the key technical 
relations between transport equations and mass transfer. The existence result is 
proved in Sections 5. Sections 6-7 deal with the uniqueness in the product and 
Gibbsian case. In particular, we prove a uniqueness result for log-concave Gibbs 
measures with the following formal Hamiltonian 

00 00 

i—l i^—X 

In Section 8 we briefly discuss the approach via optimal transportation mappings 
and the finite-dimensional case. In particular, we prove an existence and uniqueness 
theorem for a broad class of log-concave measures under " dimension- free" assump- 
tions. Furthermore, in Example 18.51 we give an example in the finite-dimensional 
case, for which our result (see Theorem 18. 4p implies existence and uniqueness for 
©, where b : R°° -» IR 00 is not BV (hence the results of [7], @] are not applicable). 

Notations: Throughout the paper p* is the dual numbers to p £ [1, oo[: ^ + ^r = 
1. We denote by T n the cr-algebra generated by the projection P n {x) — (aci, • • • , x n ) 
and by Ej™ the corresponding conditional expectation. Everywhere below || • | 
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means the standard / 2 -norm (finite and infinite dimensional). We denote by V and 
D 2 the derivatives of first and second order along H = 1% respectively. For every 
linear operator A : 1% — > 1% the notation ||A|| means the standard operator norm and 
= ^/Tr(A* A) the Hilbert-Schmidt norm. The time derivative of a function 
/ is denoted by /. We fix the standard orthogonal basis in R°° consisting of vectors 
= (Sij)jeN- We use the word "positive" in the sense of "strictly positive" (i.e. 
" > 0" ) , otherwise we say " nonnegative" 

2. The Gaussian case 

In this paper we use the following core of smooth cylindrical functions: C is the 
linear span of all infinitely differentiable functions <p(x\, • • ■ ,x n ) depending on a 
finite number of coordinates and having a compact (considered as functions on R") 
support. 

Remark 2.1. (i) The use of functions of the form ip(xx, • • • , x n ), p £ C^°(R n ), 
is natural for R°° , but differs from the standard core in the Gaussian case, 
where ip usually depends on a finite collection of measurable functionals 
Xhi, hi £ H, which are A^(0, ||/ii|| 2 )-distributed. 
(ii) Clearly, C separates the points of R°° . Furthermore, a simple monotone 
class argument shows that C is dense in any L p {v), p £ [l,oo) and any 
finite measure v on R°°. 

Let v be a probability measure on R°°. 
Throughout the paper it is assumed that 
HI) all the projections 

v n = vo p- 1 , 

where P n {x) = (x\, ■ ■ ■ ,x n ), have Lebesgue densities. 
H2) for every i £ N there exists a function fa £ L 1 ^) such that 

d ei ip dv = — J ip fa dv. 

for every ip £ C. 
fa is called logarithmic derivative of v along 

Remark 2.2. Note that these assumptions are not independent: H2) implies HI) 
for n = 1. 

Remark 2.3. It is important to keep in mind that the projections v n also have 
logarithmic derivatives given by the conditional expectations TSL^ n fa. 

We say that a mapping b : R°° — > R°° has divergence div„& £ L x (v) if the 
following relation holds for every (p £ C: 



(5) / div„6 ip dv = — J (b, Vip) dv. 

For an account in infinite-dimensional analysis on spaces with differentiable mea- 
sures the readers are referred to [8], [9]. 

We study ©, where p = p(t,x) is a family of probability densities with respect 
to v with initial condition p(0, •) = poi i- e - w e are looking for solutions p(t, x) given 
as densities of a family of probability measures fit(dx) — p(t,x) ■ v(dx). 
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Definition 2.4. We say that p is a solution of (T2|) for t £ [0, T] with initial value 
po if for every ip £ C and t £ [0, T] one has 

(6) / ipp(t,x) dv — I ipp(0,x) dv + / / (b,\7Lp)p(s,x) duds. 



Remark 2.5. The solution in the finite-dimensional case is defined in the same way. 

Remark 2.6. We note that the existence of the right-hand side is not obvious be- 
cause it is not clear a-priori that (b,Vip}p(s,x) £ i 1 (/[ ,t]^ s x v )- Nevertheless, 
we will see in the following Lemma that this is indeed the case if c defined in Q 
satisfies some natural assumptions. 

The following result has been proved by Ambrosio and Figalli in :7i (Theorem 
6.1) for po £ L°°(7). The proof of this result is the same and so we omit it here. 

Lemma 2.7. Consider the standard Gaussian measure 7 on M. d . Let ||c|| £ L p {^j), 
p > 1 and || exp(e(div 7 c)_) ||x,i( 7 ) < 00 for some e > 0. Then for any po £ L q (7) 
with q' > q = = p* there exists T = T(e,p, q') > such that the equation 

p + div 7 (c ■ p) =0 
admits a solution p on [0,T] satisfying sup tg [ 0T j ||/9t||i,<i( 7 ) < 00. 

Let us give the idea how to control the L p -norms of pt via div 7 c needed in the 
proof of Lemma 12.71 Below we set for brevity 



p t =p(t,-), and X t :=X(t,-) 

(see {3])). The well known change of variables formula for the mapping x — > X t (x) 
is given by the Liouville formula: 

Pt(X t ) = po ■ cxp(- J div 7 c(X r ) drj = p s (X s ) exp(^- J div 7 c(X r ) drj . 

One has for any q > 1 

P 9 t dry = J p1~ 1 {X t )p Q dj = J p q exp(~ J (q- l)div 7 c(X r ) dr) d'f 

< — J J Pq exp^— t(q — l)div 7 c(X r )^ dj dr. 

Applying the Holder inequality and change of variables one gets that A(t) = 
J * J pi d-f dr satisfies A' < C(A/t) s for some S > 0. By the standard arguments 
one gets that A' is uniformly bounded. This finally gives the following key estimate 
for pt: for any q' > q there exist positive constants gi(g, q'), (72(9, q') such that 

(7) J P q t d~f < ^ pi djj q J J exp(g 2 *[div 7 c]_) d-f dr. 

Clearly, under assumptions of Lemma 12.71 the right-hand side of the inequality is 
finite for sufficiently small t. 



Lemma 2.8. Let c satisfy the assumptions of Lemma \2.7\ and f be a bounded 
Lipschitz function: 

/ Lip = Sup n < OO. 

x? y \\x-y\\ 

Then the solution p t obtained in Lemma \2. 7| satisfies the following property: 



1 ) If, in addition, 

div 7 c G L N (~/), ||c|| G i p '(7) for some N > q* , p' > p, 
then there exist positive constants C, d depending on 

P,q,p',q', l|c|| LP '( 7) , ||div 7 c|| L iv( 7) , || exp(e(div 7 c)_)|| L i (7 ) 
l|Po|| i9 '( 7 ), sup |/|, |/| Lip 



such that 
(8) 



fp] +5 d 7 - fp\ +5 d 1 



< C\t-s\, for all t,s€ [0,T]. 



2) Without any extra assumption there exists a positive constant C depending on 
P,v', l|c|U»( 7 )! II exp(e(div 7 c)_)|| L i {7) , \\ Po\\ Lq > {l} , \f\u p 

such that 



(9) 



fp t d-y- fp s d-f 



< C\t-s\, for allt.se [0,T]. 



Proof. We proof only 1) because the proof of 2) is easier and follows the same 
line. In the same way as in [7] we reduce the proof to the case when X(t,x) is a 
globally defined smooth solution to X = c(X), X(0, x) — x. We apply the change 
of variables formula for the mapping x — > X t (x). Let s < t, 5 > and / be a 
bounded Lipschitz function. 

,1+8 



p L t +0 fdj= / p d t (X t )f(X t ) Po d 1 



p 5 s (X s )f(X t )cxp(-S j div 7 c(X r ) dr)p d 1 

p s s (X s )f(X t ) d 1 + J p s s {X s )f[X t )[exp{-8 J div 7 c(X r ) dr) - l]po d 1 
= J P\ +S f dj + J p s s (X s )(f(X t ) - f(X s )) po d 1 
+ f p 5 s (X s )f{X t )[exp(-5 f div 7 c(X r ) dr) - l] Po dry. 



Here we use that p s ■ 7 is the image of po ■ 7 under x -+ X(t,x). Note that 
|e~* — 1| < u(t), where u(t) = e max '-~ t ' '\t\. Since / is convex one can apply the 
Jensen inequality. Then the last term in the right-hand side of the above inequality 
can be estimated by 

J p 5 s (X s )J u(8(t-s)di^c(X r )) dr po d 1 . 

The latter can be estimated by 

1 



sup l/l 



,91 8+1 



d~{ 



-t - 
< sup l/l 
- 1 



t 



i q± (S(t — s)div 7 c) p r dr dj 

Pl lS+1 d 1 
i q i q " (6(t- s)div 7 c) dr d 1 
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p q r dr dj 



Applying again the Holder inequality and Lemma 12.71 (see (J7J ) it is easy to show 
that the latter does not exceed C\t — s\ for some C, e where t — s,S are chosen 
sufficiently small and q* close to 1. 
Analogously, we estimate 



p s s {X s )(f(X t ) - f(X s ))p d 7 < ||/|| Lip / p s s (X s ) / ||c(X r )|| drp d 1 



<(f-*)H/|lLip 



,P2<5+1 



<(*-*)ll/llup 



2<5 + l 



1 



t - s 

\\c\\ p ' 2P d-f 



p r drd'y 
1 



t 



p q r drd'y 



Choosing p* 2 close to 1 and a sufficiently small 5 we get the desired result. 



□ 



Remark 2.9. Below we generalize the existence result of [7] in infinite dimensions 
which has been established under the assumption that ||c|| e L p (-y), p > 1. We 
prove it under the weaker assumption Cj € LP (7). Furthermore, we work with our 
slightly weaker notion of solution from Definition 12.41 above. 



Lemma 2.10. Let v = 7 = fL_i7i ^ e a P r °duct of the standard Gaussian mea- 
sures. Assume that c = (ej) : R°° — ¥ R°° is a mapping satisfying: 

1) There exists p > 1 swc/i £/ia£ Cj € L p ("f) for every i. 

2) TTie divergence div 7 c satisfies 

(10) exp(e(div 7 c)_) e L x ( 7 ). 

TTien £/iere exists T = T(e,p,q') > sttcft that the equation p + div 7 (c • p) = 
/ias a solution pt on [0, T] /or every initial condition pa g L 9 (7) urai/i some q' > 
-Hj. =p*. /n addition. sup te[0T] ||/?(f, -)||i«(7) < °°- 

Proof. Let us set: 

n 

C(„) = X! E 7" Cl ' e * 
i=l 

and 

• (7 P^) = (PO ■ 7) Pn 1 - 

Equivalently, po. n = JE^ n po- Note that assumption 1) ensures that C( n ) is well- 
defined. 

It is well-known (and easy to check) that 

div 7 C(„) — E^" [div 7 c]. 

This relation easily implies that [div 7 C/ n )]_ < E?"[div 7 c]_ and |div 7 C(„)| m < 
E^»|div 7 c| m , m > 1. 

By convexity and Jensen's inequality one has 



Consider the equation 



p n + div 7 (p„ • c (n) ) = 
7 



with p n \ t =o = p a ,n- Since ||/3o,n|liy( 7 ) < ll/°o|| L ,'( 7 ), we get by Lemma ES that 
there exists T = T(e,p,q') > such that this equation admits a solution p n on 
[0, T] satisfying the following dimension-free bound 

M= sup \\p n (t, •)IU«(7) < °°- 

tS[0,T],n6N 

For any function tp £ C, the following identity holds: 

(11) / ipp n (t,x) dj= ipp n (0,x) dy + / (c„, V(p)p n (s,x) dy ds. 



Applying a diagonal argument one can extract a subsequence (which is denoted 
in what follows again by p n ) such that {p n (tk, x)} converges weakly in £'(7) to some 
function p(tk,x) for any t& from a dense countable subsequence / = {tk} C [0, T]. 
Then x)} converges weakly in L g (y) for any t £ [0, T] to a function denoted 

in what follows by p(t,x). Indeed, since 

swp\\p n (t,x)\\ Lq ^ < 00, 

n 

by a standard subsequence argument it is enough to show that {J fp n (t,x) dy} is a 
convergent sequence for every / 6 L p {j)- Clearly, it is sufficient to check the claim 
for functions from C. Since for such a function / £ C the sequence {J fp n (tk, x) dj} 
is convergent for every tk € /, it follows easily from the estimate (|9j (we use here 
that / is cylindrical, hence the right-hand side of ((9J depends on a finite collection of 
Cj) that {J fp n (t, x) dj} is a Cauchy sequence. Thus, we get that p n (t, x) — > p(t, x) 
weakly in L q (^) for every t £ [0, T]. 

One has for every smooth cylindrical function tp = <p{x\, • • ■ , x%) 

lim^ J ipp n (t, x) dy — J <pp n {0, x) d'yj = J ipp(t, x) dy - J <pp(Q, x) d'y. 

Set: g n (s) = J(c n ,'V(p)p n (s,x) d~f. Using the convergence cr n \ — > c in L p (~f), one 
gets 



lim.g„(s) = lim / (c„, Vip}p n (s, x) d'y = (c,V(p)p(s,x) dy = g(s) 
™ n J J 

for every s £ [0, T]. Clearly, 

sup \g n (s)\< sup ||p„(s,-)||L9( 7 )||£fc°c|| LP(7) ||V^|| L oo (7) . 
se[o,r],neN se[o,T],neN 

Then the Lebesgue dominated convergence theorem implies 

(c n ,Vip)p n (s,x) dy ds — >• J J \c,V (p) p(s,x) dv ds. 

Passing to the limit in (fTT|) we get that p is the desired solution. □ 

Before we proceed to the general case, let us explain the main idea of the proof. 
We construct a mapping T pushing forward another measure p onto v. If T is 
sufficiently smooth, one can define the following new drift: 

c = DS(T) ■ b(T), 

where S is the inverse mapping to T. One has 

p = v o S^ 1 



and 

{DT)- 1 = DS{T). 
Let us give a heuristic proof of the key relation: 

(12) div^c o S = div„6. 
Take a test function ip £ C. One has 

(13) J (Vf, c) o S dv = J (Vip, c) dp, = — J ip div^c dp = — J <p(S) div^co S dv. 

On the other hand, we note that by the chain rule 

(14) Vfo>(S)) = (DS)*V<p(S). 
Hence J (V<y9, c) o S dv is equal to 

(15) / {(DS*y x V{ip{S)),c{S)) = / (V(ip{S)),b) dv = - [ ip(S)div v b dv. 



Obviously, (|T3| and imply (fl2|) . 
Now let us try to solve the equation 

p + div„(p • b) = 

for a wide class of probability measures. Assume that v is the image of the standard 
Gaussian measure 7 under a mapping T. Setting c = DT~ X ■ b(T) = DS(T) ■ b(T) 
we transform the equation into 

(16) ff + div 7 ( ff -c) =0, 

where every p ■ p is the image of g ■ 7 under T. Applying Lemma 12.101 we obtain a 
solution to (TTdT). Then the function 



p{t,x) = g(t,T'\x)) 

presents the desired solution. This follows immediately from the definition of solu- 
tion in Definition 12.41 and the change of variables formula. 



3. SOBOLEV ESTIMATES FOR TRIANGULAR MAPPINGS 

Consider a Borel probability measure p on R°°. We denote by pi = p Pr 
the projection of p onto the subspace generated by the first i basis vectors. Recall 
that throughout the paper pi is assumed to have a Lebesgue density, which will be 
denoted by p^. For every fixed x = (x%, • • • , £i-i) we denote by p^: i the corre- 
sponding one-dimensional conditional measure obtained from the disintegration 
of pi with respect to fii—i. Note that pi-\ = pi o Pj~ x . These measures are related 
by the following identity 

ipp^dx — J ip(x,Xi) pi(dxdxi) = J (^J tp(x,Xi)p.^ ti (dxi)j p l - 1 (dx) 

for all bounded Borel cp : W — >• R. 

If for /^-almost points x the corresponding conditional measures p^: i have Lebesgue 
densities, they will be denoted by p^± . In this case the latter formula reads as 

(pp^dx = ( viXiX^P^dxA p^^x) dx. 



In this section we study a-priori estimates for so-called triangular mappings. We 
call a mapping T : R°° — > R°° triangular if it has the form 

oo 

T = y^Tj{xx,--- ,Xi)ei 

4=1 

and, in addition, Xi — > Ti(xi, ■ ■ ■ ,Xi) is an increasing function. 

Given two probability measures p and v on M°° we are looking for a triangular 
mapping T : R°° — > R°° pushing forward p onto v. The proof of existence of 
mappings of such type on R°° for a broad class of measures can be found in ( [T2] , 
[8]). It relies on the fact that T can be precisely described in terms of conditional 
probabilities of p and v. In the one-dimensional case T = is defined by the 
relation 

fx rT(x) 
/ p„(t) dt = / Pv {t) dt. 

J —oo J— oo 

In the finite- and infinite-dimensional case T is obtained by induction 

1) T\ is the increasing transport of the projections on the first coordinate 

(17) T\{x\)=T iiuVx {x\) 

2) Tj, i > 1, is the increasing transport of the one-dimensional conditional 
measures ft, v: 

(18) Ti(xi,--- ,Xi) = Tjif(xi), 
where p, = p,^, v = v^_ i{x) v x = (an, • ■ • ,^-i). 

The existence result and the basic properties formulated in the following theorem 
have been proved in the ([H], [12]). 

Theorem 3.1. Let /i be a probability measure on R°° satisfying the following as- 
sumptions: 

1) Any projection pi, i £ N, is absolutely continuous measure with respect to 
Lebesgue measure on W . 

2) For pi-almost all x the corresponding conditional measures p xi are abso- 
lutely continuous, with respect to Lebesgue measure on R. 

Then there exists a triangular mapping T pushing forward p, onto v . The mapping 
T is unique up to a set of p-measure zero. 

Let v be another such measure. Then there exists a triangular mapping S pushing 
forward v onto p. In addition, they are reciprocal: 

T o S = Id v - a.e. 

S o T = Id p, — a.e. 

Remark 3.2. The absolute continuity of p^: i follows, in particular, from the absolute 
continuity of the conditional measures of p in the direction . The latter follows in 
turn from the existence of the logarithmic derivative /3 4 . In particular, the measures 
satisfying our general assumptions from Section 2 do satisfy the assumptions of 
Theorem O 

Note that in the one-dimensional case T and S are just non-decreasing mappings 
which can be written exactly in terms of the distribution functions of p and v. Hence 
T (S) admits classical pointwise derivative T' p (^)-almost everywhere. One can 
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easily check that T' is /z-a.e. positive, because otherwise v has a non-trivial singular 
part. In particular 

T'(S)-S' = 1, n-a.e. 

Remark 3.3. Since every Tj is constructed as a one-dimensional increasing trans- 
portation of conditional measures, the following generalization of the above relation 

d^S) ■ d Xi Si = 1 M-a.e.. 

is valid in the finite- and infinite-dimensional case. 

If T and S are smooth (meaning that every function Ti, St is smooth) then their 
Jacobian matrices are triangular: 



DT = 



( d Xl Ti 

d Xl T 2 & X2 T 2 

$x i T% $x 2 T% &x 3 T^i 

/ d Xl Sx 

d Xl S 2 d X2 S 2 



DS 



V 



d Xl Si d X2 Si d X3 Si 



9 Xi _ ± Si d Xi Si 



\ 



In addition, if all d Xi Ti(x) ^ at x (which happens /i-a.e.), then 

DS{T[x)) = (DT)- 1 . 

In this section we establish global Sobolev estimates for triangular mappings. 
Note that some (dimension-dependent) Sobolev estimates for the triangular map- 
pings have been obtained in . See also [25] for similar results on optimal trans- 
portation. 

Definition 3.4. Let v be a probability measure on K°° and / £ L}{y). We say 
that d Xi f is a Sobolev partial derivative of / if ffa £ I^iy) and 



J d x J ■ <p dv = - J f ■ d Xi ip dv - J 
for every ip £ C. Obviously, this defines d Xi f uniquely. 



ftp fa dv 



Definition 3.5. Let p £ (l,oo) and v be a probability measure such that fa £ 
LP (v) for all i. We say that a function / belongs to the Sobolev class W 1,p (v) if 



2 \l/2 



L*{y) 



< 00. 



If, in addition, every f Xi has all Sobolev partial derivatives in L p (v) and 



'■3 



we say that / £ W 2 ' p (v) 



Remark 3.6. Though we shall not use this below, it follows by [3] that due to the 
assumption that fa £ L p " (y) for all i € N both W 1,p {v) and W 2 ' p (v) are complete. 
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All derivatives of S and T below will be understood in the Sobolev sense, with 
respect to v and \x respectively. 

We start with a one-dimensional estimate. 

Proposition 3.7. Let \i = e~ v dx, v = e~ w dx be two probability measures on 
R with continuously differentiable functions V and W . Consider the increasing 
mapping T pushing forward \x onto v. Assume that 

xW'ix) > 

p-l 

for some p > 1 and e > 0. Assume, in addition, that \x\ P ''"* ' G L x (u), \V'\ P+S G 
L 1 ^) for some 5 > 0. Then T := DT G U>(jj,). 

Proof. By the change of variables formula f* oo e~ v ^dt = e~ w ^ dt. Clearly, 

this implies that T is differentiable and 

e -V =T , e -W(T) m 

Moreover, T' = e w ( T )- v [ s continuously differentiable and satisfies 

T" = T'(W'{T)T' - V'). 
Take a positive test function £. Integrating by parts one obtains 

f (T'fe- V i dx = 
Jr 

= -(p-l) f T{T'f- 2 T" er v £, dx+ f TiT'Y^V 1 e~ v i dx - f {T') p ^e~ v i' dx 

JR it JR 

= -(p-l) f TW'{T)(T'Ye- v £,dx+p f T{T') p - V'e"^ dx - f (T') p ~ 1 e~ v i' dx. 

JR JR JR 

Note that 

e f {T') p e- y £ dx<-p f T(T') p - V'e"^ dx - f (T'f^e^i' dx 
Jr Jr Jr 

< | / (T') p e- V £ dx + N(e,p) f T p (V') p e - v £ dx - f {T 1 f' 1 e~ v dx. 

2 JR JR JR 

By the Holder inequality 

f T p {V') p e- v £ dx< f (V') p+S e- V £ dx + C{8,p) f e~ v £, dx 

Jr Jr jr 

= [(V'r+ 5 e- v Zdx + C(6,p) f W^e-^dx, 
jr Jr 

- f (T'y-'e-v? dx<-\ {T') p e~ v i dx + c(p,e) f 
JR 4 jr jr s 

Thus, we obtain a bound for j R {T / ) p (V / ) p e- v ^ dx. Taking a suitable sequence 

f p 

{£n} with i n — > 1 and lim„ J R ^ £ n e~ y rfa; = we complete the proof. □ 

Now let us come back to the infinite-dimensional case. Below in the proofs we 
apply the following scheme. 

1) Prove the statement for smooth positive densities. 

2) Approximate the Sobolev densities by smooth positive densities and deduce 
the desired estimates. 
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• P ie- y dx. 



First, we need an approximation lemma. 



Lemma 3.8. For every probability measure v = p dx on W d satisfying Pi = ^y- € 
V{y) for some p > 1 (with := on {p — 0}) and all 1 < i < d there exists a 
sequence of probability measures v n — p n dx such that 

1) v n — > v in variation norm, 



2) lim„J 



VPn 



p n dx = j 



111 

P 



p dx. 



3) every p n is smooth and nonnegative, 

4) the partial derivatives d Xi p n are uniformly bounded and integrable for every 
i, n. 

Moreover, if every logarithmic derivative Pi has Sobolev derivative along any coor- 
dinate Xj and, in addition, there exists p > 1 such that 



d Xj pi = 



<l, .! l> d Xi p-d x p 



p p" 
then item 2) can be strengthened as follows: 



e l p (p), p t e l 2 p{v) 



lim 



Wpn 



J 2p 



dx 



Vp 



2p 



p dx, lim 



Pi 



dx 



p dx. 



Sketch of the proof: The proof is quite standard and we only give a sketch 
here. It consists of two steps: 1) approximate p by fn'P, where {f n } is a sequence of 



smooth nonnegative compactly supported functions satisfying sup^ 
for every n such that ip n — > 1 pointwise; 2) approximate compactly supported p 

d _ 3,2 

by the ordinary convolutions with smooth kernels (27rf)2 e~ ^ . For 2) we apply 
Jensen's inequality and Fatou's lemma. 

Remark 3.9. It is straightforward to check using (jTT)) . (TT5|) that T and S are contin- 



uously differentiable, p 



dx, v 



-w 



dx, and V, W have uniformly bounded 



derivatives. Note that in this case all conditional measures have positive densities 
and all the derivatives d Xi Si, d Xi Ti are positive. More precise statements about the 
regularity of triangular mappings can be found in [H] (Lemma 2.6) and [25] . 

Proposition 3.10. Consider the triangular mapping S pushing forward v onto 
7. Assume that Pi € L 2 (v) for all i. Then for every i the mapping Si belongs to 
W 1,2 (v). In particular, the following estimates hold: 



(d Xi Sifdv < J (TE^Pi 



dv, 



In particular, 



(d Xj Si) 2 d V < j ^p^dis- j(\E^p 3 y d V , 



% > j. 



Proof. First, we note that due to the finite-dimensional structure of triangular 
mappings it is sufficient to establish the statement for finite-dimensional measures. 
We start with the case when p u — e~ v , where V is a smooth function on M. d with 
uniformly bounded derivatives. 
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In the proof we apply the following relation between the logarithmic derivatives 
and conditional densities of the corresponding projections 

tt 9 Xi p u ±, 



Pu^ 

We keep the notation p 7 for the Lebesgue density of the 1-dimensional standard 
Gaussian measure 7: 

1 _£ 

Pi 



/2ir 

According to Remark 13.91 all the functions Si are continuously differentiable and 
d Xi Si > 0. It follows by the change of variables formula that J Sf dv = J x\ d'y, 
hence 5, € L 2 (v). This implies that d Xi Si £ L x (v). Indeed 



d Xi Si dv = ~ J SiPi dv < ||5i|| L 2 M || / 8j|| L 2( J/ ). 

Let us estimate J(d Xi Si) 2 dv. One has the following explicit formula for Si (we 
stress that the expression below makes sense because p v is positive as well as the 
densities of its projections and conditional measures) : 

pSi(x 1 Xi) r-Xi 

(19) / p 7 (t) dt= / p v ±Xt) dt, 



where 

M Pu t (x,t) 
P»± At) ~ 



I-00 P»i ( x > *) dt 
Differentiating (JTHJ) along one obtains 

(20) J^L-=d Xi Si. 

Formally applying integration by parts we get 

({d Xi S t ) 2 dv = [d Xi Si-^r dv = - I ' d Xt S % ■ Sf^%- dv 
J J P~{{Si) J P~f{S t ) 

d x t Pu^ z f 1 ( d XiPv^ t \ 2 Pv^t 

Si ; „ x ;' dv < — — — ( — — dv = 



V(— ) 

V p ± / 

J vj^) dv = J ^ 



Pi (Si) J d Xi Si V p v i_ j I pj(Si) 

dxiP^. \ 2 



To justify the above computation we integrate not over v but over £ • v, where £ is 
a compactly supported smooth function on Mr. By the same arguments one gets 

J {d Xi Sift dv<(l + e)J (Ef«A) 2 e dv + c(e) J (^) 2 £ dv. 

Choosing "an appropriate" convergent sequence — > 1 with lim^ f( -^-^ ) 2 ^k dv — 
one easily gets the desired result. 
Analogously, one has for d x . Si, i =/= j: 

I-00 d *j Pvi fa> *) dt I-oc Pn 0> t) dt f-oa d *j P"i ( x ' *) dt ' 



P 7 (Si)d Xj Si 



J^p Vi {x,t)dt (j \ /)( . ,:,-./, ,//)' 
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(21) 



Denoting the right-hand side by / one gets 

d Xj Si _ f 



Consider the following formal computations 

d XiSi r2 7, j /" [ 9 Xi Si 2 2 



s/ ^i-i dXi = - I / — 7W\Sif dvi-i dxi 

Pj{Si) J J Pj(Si) 

~ 2 / / % \ ff x * d v i-\ dxi < [ [ — - J Xi dvi-i dxi 
Pj(Si) J J p 1 {S l )d Xi S l 



f fx, \ 2 dv , = ff d X] p Vi _ JZyd^Pv^t) dt y 
\P„± J 1 J \ p Vi n,P^(i,i) dt ) 



dv 



Ef ft - Ef^ft) 2 dv = j (E^ft - Ef'- 1 ^) 2 
E^ft) 2 di/- y (Ef-ft) 2 dzA 

To justify the global integration above we integrate again with respect to £ • v, 
where £ is a compactly supported smooth positive function on R''. Repeating the 
above arguments one gets 

J (d Xj Si) 2 £, dv = - J Si ■ d Xj Si ■ d Xi £ dv - J J ~^j S ff 2 ^ dv l ^ 1 dx l 

S- 

-Jfxii dv^i dxi 



PiiSi)' 

The term — J Si ■ d Xj Si ■ d Xi £ dv can be estimated by 



ef(d Si Sift dv+-J S>{^)\ dv. 



Finally 

(1 - e) J (d X3 S t ) 2 t dv< J (Ef'ft - ii-:;' ft) 2 £ dv+-J S?(^ft dv. 

Estimating the term J Sf (-^p) 2 £ dv by the Holder inequality and choosing an 
appropriate sequence — > 1 we complete the justification of the above formal 
computation. 

It remains to approximate an arbitrary density p on M. d with J f3f p dx < oo by 
smooth densities and prove that the desired a-priori estimate is preserved under 
taking the limit. Indeed, let pn<.\ = e~ Vk be approximating densities constructed in 
Lemma T3. 81 Let S^ k ' be the triangular mappings pushing forward pnA dx onto 7. 
Note that the functions S"f } • p (k) are in W 1 ' 1 (W l ). Indeed, 

J \\DSW\\ HS p W dx<[J \\DSW\\% 8PW dx]'' 2 <J2j i^fPik) dx < ||VPwll^. 2 (a») 
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and 



l^i ll|Vp (fe )||da;< 



\s[ k) \ 2 P[k) dx 



1/2, 



x, dj 



1/2, 



/P(fc) II VK 1 



Using Sobolev embeddings and extracting an almost everywhere convergent subse- 
quence one can assume from the very beginning that fiK" ■ pnA converges almost 
everywhere. Using that pify converges almost everywhere to p, one can easily see 
that S^ converges to a triangular mapping S at i^-almost all points. Using almost 
everywhere convergence it is easy to check that S pushes forward p onto 7. From 
the almost everywhere convergence and the following change of variables formula 



|S< fc >| 



-P(k) 



dx 



dj 



J WSfpdx, 



one gets that S 1 -^ ■ y/p(kj converges to S ■ yfp in L 2 (R d ). Applying the estimates 
above one proves that VSf } • y/pjkj converges (up to a subsequence) weakly in 
L 2 (W l ) to a vector field v . Standard integration by parts arguments show that v 
can be identified with VSj • ^fp. Indeed, 



<P ■ d Xl S {k) p {k) dx=- J d Xi <p- S {k) p ik) dx- J <p S ik) ■ d Xt p {k) dx. 
The left-hand side converges to J ip ■ v^fp dx and the right-hand side to 

— J d Xi ip ■ Sp dx — J ip S ■ d Xi p dx 
(this follows from the strong convergence of ■ y/pjk) an d ^P(k)/y/Plk : ))- Hence 



\d Xt S\ 2 pdx<lun k J \d Xt S^\ 2 p (k) dx<]imJ2 j^Pfpik) 



dx 



E 



(0i) 2 pdx. 



The other estimates can be justified in the same way. Hence the proof is complete. 

□ 



Remark 3.11. It is clear, that formula (|20[) remains true in the non-smooth setting, 
for instance under the assumptions of Proposition 13.101 We understand d Xi Si as 
the Sobolev derivative or just as the classical derivative of the one-dimensional 
increasing mapping x% — > Si. Taking product from i = 1 to d in (|20[) we obtain the 
change of variables formula 



(22) 



Pr 



(27T) 



-d/2„-i|S| 2 



det DS = Y[p^(Si)-d Xi Si 



Remark 3.12. In what follows we will give a proof for a-priori estimates only in 
the case of smooth and positive densities. The complete justification for Sobolev 
densities can be spelt out as in the proof of Proposition ^. 101 

In particular, note that since all the densities are positive and smooth, all the 
expressions in the intermediate computations are well-defined. 

We also note that in the general (i.e. Sobolev) case d Xi Si remains positive v- 
almost everywhere, because d Xi Si = implies that the corresponding conditional 
density of v vanishes, which can happen only on a set of ^-measure zero. 
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Remark 3.13. Another estimate of this type has been mentioned (without rigorous 
proof) in 251 

Moreover, if the image measure /i is not Gaussian, but uniformly log-concave, i.e. 
has the form \i = e~ w dx with D 2 W > K ■ Id, K > 0, then 

'd XiXk Sk N 2 



[ $ dv>K f\\d Xi S\\ 2 dv + ^j 



d Xk Sk 



dv. 



Remark 3.14. One can easily generalize Proposition 13 . 101 to the i p -case. Under the 
same assumptions for every p > 1 there exists C = C{p) such that 



and 



J(d Xz S l ) p dv<C(p) J \JE?0i\ P dv 
J \d Xj Si\ p dv < C( P ) J |]Ef«& - K'-'fo 



dv. 



The proof follows along the same line of arguments as above. 

We prove some L p -estimates for higher order derivatives. Taking logarithm of 



both sides of the identity - ^ — d Xi Si and differentiating the result along Xj one 
gets 

<>r P v d x x Si 

1 +Si-d Xj Si = 



Pv 1 



Hence 



dx i x a Si — Ox ■ Si 



Pu 1 



d x - Si 



Si • d Xi Si ■ d Xi Si 



Then applying the standard Holder and Jensen inequalities and using that S% £ 
L N (rj) for every N > 0, we get trivially the following bound. 

Proposition 3.15. For every p > 1 and e > there exists C(p,e) such that under 
assumptions that /3k £ L p (v) for all k, one has 



d Xi Si 



Lp(u) 



and 



H^Xj^HiP^) < C(p,e) WWWlt+.M ■ \\\P 3 \ P \\ L 2 + e {u) . 

See also Remark VS.l'A 

It remains to estimate \\d Xj x m Si\\T,p( u ) for j ^ i,m =/= i. 

Proposition 3.16. Let j < m < i and p > 1. Assume that (3j,f3 m £ L 2p (v) and 
f3 'j admits partial Sobolev derivative d Xm /3j £ LP(y). 
Then there exists C(p) such that 

\d XjXm Si\ p dv < C( P ) f (pf + 0% + \d Xm ^\ p ) dv. 
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Proof. In the same way as above the proof is reduced to the case where the densities 
are smooth and positive and admits integrable derivatives (see Proposition 13. 10]) . 

For simplicity let us consider only the case p = 2. We use relation (|2"Tj) : d x . Si = 
d Xi SijL- with 

(23) f = SHod XjPvi {x,t) dt _ Pvz (x, t)dt d x . Pv . (x, t) dt- 



Differentiating d Xj Si = d Xi Si along x m we get 



ox 



dxiX m Si 



d T - Si dx-n 



Pv± 



d XiXm Sj dx ™Pvji q Q 5 Xm / 



The bounds for the first two terms follow immediately from the previous estimates. 
Let us estimate f (d Xi Si p*"™^ ) • One has 

(d Xl S l ) 2 ^- dv = [ 9x ^\ ^ Xm dv= [ [ dx f Q l . fl m dv^ x dxi 

dx, fa\ S ifx m dv i-i dx '<- - 2 [ [ Ta\ fx m fx i x m dui-i dx t 
Pj{Si) m J J p 7 {Si) 

-II p J1)£a dXi Kt^) 2 d "- 

Differentiating (|2Uf one gets 

/., <>.:■.. , /',, d X] p v J x ca d Xm p^{x,t)dt d Xm p v J X ao d X] p v% {x,t)dt 



Pv^ Put Put fToo Pvi (x, t) dt p Vi p v . (x, t) dt 

JT 00 d XjXm p Ul (x,t)dt 2 J™ 00 d Xm p^(x,t)dt f^doojPv^x^) dt 



JZp^t)dt - (/^^(M)*) 
Arguing as above, we easily get 

e(^) 2 <cj + f& + (d^r) dv. 

Hence the proof is complete. □ 

4. Transfer of solutions 

We consider in this section a probability measure v on the space X, where 
X = M. d or X — R°°. We denote by 7 the standard Gaussian measure if X = R d 
and the product of the standard Gaussian measures on R 1 

00 

»=i 

if X = R°°. 
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Everywhere in this section S is the triangular mapping pushing forward v onto 
7. As usual, we set: T = S^ 1 and c = DS(T) ■ b(T). 

It will be assumed throughout that v admits logarithmic derivatives j3i € L p (y), 
i G N, at least for some p > 1 (independent on £). Thus by Remark 13.141 the 
functions are all Sobolev, more precisely 5, G L n (v) for all n and |VSi| G L p {y). 

We also apply systematically the 

Chain rule: for every / G W 1,p (v) and every smooth compactly supported 
function ponK one has ip(f) G W 1,p {v) (see Lemma 2.6.9 [5]). 

We also need the following important fact (see Theorem 2.6.11 [5]). 

Theorem 4.1. Assume that d < 00, p > 1. TTie sei of smooth compactly supported 
functions is dense in the weighted Sobolev space W ,p {v), v = p dx provided logp G 
W^ p (u). 

Everywhere below pi is the logarithmic derivative of v along a. 

Lemma 4.2. Assume that X = R°° and fa G L p {v), bi G L p (v) for some p > 1 
and z. Assume that div„6 G L l (v). Then every Ci G £ (7) and, in addition, c 
admits a divergence and the following relation holds: div 7 co S = div„6. 

Proof. It is easy to see that for every <p G C£°(M. n ) the function <p(S) is cylindrical 
and belongs to W l *{y) by the chain rule. Clearly Vip(S) = {DS)*V(p(S). By 
Theorem 14.11 there exists a sequence of C^°(R")-functions {ipk} such that ipk —> 
<p(S) and Vi/>fc —> VUp(S)] in L p (v). This, in particular, implies that the relation 
— J f ■ diVj/6 dv = /(V/, b) dv holds for / = ip(S). The assumptions of this lemma 
now imply that a G L l (^) for every i. Hence 



[(V<p,c)d 7 = J{V<p,c)oSdu = J \{DS*)- 1 V{f{S)),c{S)) 



dv 

(VQp(S)),b) dv = - J ip(S)div u b dv = - / ip div„6(T) dry. 

This implies the last assertion. □ 

Lemma 4.3. Assume that X = R d and (i, G L p (v), for any p > 1 and all i G 
{1, • • • , d}. Assume, in addition, that — is locally integrable in any power. Then 
for every <p G Co°(R n ) there exists a sequence of (K™)- functions {ipk} such that 
ipk — > <-p(T) and Vipk — > V[y(T)l ^-almost everywhere and weakly in £"(7) for any 
n>l. 

Proof. In the same way as in Lemma rOl we show first that ip(T) belongs to W 1 ' n ( / y) 
for any n > 1. Then one can find approximating sequences for any n and choose 
the desired one by a diagonal argument. To show that <p(T) belongs to W 1,n ( 7 ) we 
apply Remark \3. 141 One gets that the functions (DS*ei,ej) are integrable in any 
power. One has 

J \\V[<p(T)]]\\ n d 7 < J \\DT*.V<p(T)\\ n d 7 = J \\(DS*r l -V<p\\ n dv. 

It remains to show that all the functions ((DS*)~ 1 ei,ej) are locally i/-integrable 
in any power. Since (DS*ei,ej) admits the same property, we only need to show 
the local integrability of ^ de ^ s ^ n ■ Taking into account that = (sec |12D ) 

and the assumptions of this lemma we only need to show that S is locally bounded. 
Applying again the local integrability of y- we get that S belongs to the standard 
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local Sobolev class W, J? for every p. Then the boundedness of S follows from the 
classical embedding theorem. □ 

Proposition 4.4. Assume that X = M°° . Let p(t,x) be a solution to the equation 
p + div t/ (p ■ b) = 0. Assume that there exists p > 1 swc/i i/iGrf sup te r 0T i \\p(t, •) • 
fri||z>*0) < 00 anf 4 * n addition, Pi G L p (v) for all i. 

Then the function g(t,x) defined by the relation g-j = (p-v)oS~ 1 is the solution 
to the equation 

g + div 7 (g • c) = 

Proof. We know that 

J (pp(t,x) dv = J<pp(0,x)dv + J J ' {b,V<p)p(s,x) dv ds, for all t G [0,T]. 

Let us apply this identity to the function ip = i)(S), if) G C^°(R™). This is possi- 
ble, because V<p G L p {y) by Proposition 13 . 1 01 and one can approximate ip and its 
gradient in L p (v) by smooth cylindrical functions (see the proof of Lemma 0T2]). 
By the change of variables formula 

tpp(t,x) dv = J tfjg(t,x) dj, J <pp(0,x) dv — J ipg(Q,x) dj. 

Taking into account the chain rule Vp — (DS)*Wip(S) we immediately get for all 

te[o,T] 

(b,V(p)p(s,x) dvds = j J(DS-b,Vi/j(S))p(s,x)dvds= J J <c,W> g(s,x) d<y 

Hence g satisfies the desired integral relation and the proof is complete. □ 

Proposition 4.5. Assume that X — R°° and the following assumptions hold 

1) ^ G L m (y), for all m, i G N 

2) y— is locally integrable in any power for every n G N, where p Vn is the 
Lebesgue density of the projection v o P~ l = p Vn dx 

Assume, in addition, that that g solves the equation g + div 7 (g • c) = for some c 
satisfying sup tg j T ] \\g{t, ■) ■ Cj||£i+<;( T ) < oo for some e > and all i. 

Then the function p defined by the relation p ■ v — (g ■ 7) o T^ 1 is the solution to 
the equation 

p + div„(p • b) — 0. 

Proof. We apply the same arguments as in the proof of the previous proposition. 
We note that f Q f(b,Vtp)p(s,x) dv ds is well-defined for every ip G C, t G [0,T], 
because bi — ((DS)~ 1 c(S),ei), any function ((DS)~ 1 ei,ej) (depending on a finite 
number of variables) is locally integrable in any power by the previous proposition 
and 

sup \\ci(S)p(s,-)\\ L i+e (u) < 00 

0<s<T 

by the change of variables formula. 
The relation 

J f(c,Vip)g(s,x) dj ds = J J (b, Vcp)p(s,x) dv ds, t G [0, T], 
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with ip = <p(T), ip € C, can be easily justified with the help of Lemma 14.31 In- 
deed, since ip can be approximated in the corresponding Sobolev norm by smooth 
functions, the integral in the left hand side is well-defined and the chain rule is 
applicable. □ 



5. Existence 

In this section we prove the existence result by transferring a solution in the 
Gaussian case (whose existence was established in [7 ) with the help of a triangular 
mapping. 

Theorem 5.1. Assume that is is a probability measure on R°° such that: 
1) 

faeL m {is) for allm,ieN; 

2) there exists p > 1 such that 

h G L p (p) for all i G N; 

3) there exists e > such that 

exp(e(div„6)_) G L 1 ^); 

4) y— is locally integrable in any power for every n G N, where p Vn is the 

Lebesgue density of the projection v o P~ x = p Vn dx. 

Then for every po G L q (y), and q with q' > q > p* there exists to > depending 
on the above parameters such that the equation 

p + div„ (b ■ p) = 

has a solution on [0, to] satisfying p\t=o = Po o^d 

(24) sup ||p(i,-)IU« < oo- 

te[o,t ] 

Remark 5.2. One can easily see that assumptions 1), 4) together with Sobolev em- 
bedding imply that 1/ p Vn is Holder continuous. This may be sometimes restrictive 
for applications. We stress that we need 1) and 4) mainly for a-priori estimates 
on DT (see Lemma [4~3| . There are some possibilities to weaken these assumption. 
Some (weaker) sufficient conditions for T to be locally Sobolev one can find in [29] . 
This result is applicable if one has high integrability of po and bi. Some bounds 
on DT are available under the assumption that v is log-concave. They work even 
better if instead of triangular mapping one applies optimal transportation. See 
Theorem 18.41 and Example 18.51 below. 



Proof. Consider the triangular mapping T sending 7 to v. Let us show that c = 
DS(T) ■ b(T) satisfy all the assumptions of Lemma \2.10\ One has 

j=i j 

It follows immediately from the assumption of this theorem and Remark 13.141 that 
Cj G L p (v) for every i and p' < p. 

By Lemma 14.21 div 7 c o S = div„&. Consequently, the assumption 2) of Lemma 
12.101 is satisfied. Hence, there exists a solution to the equation g + div 7 (g • c) = 
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with g(Q,x) = p(0, T(x)). Proposition 14.51 now implies that p = g(S) is the desired 
solution. 

Property (f2"4"| is a slight extension of the corresponding statement of Lemma 
12.101 and can be easily checked. Hence the proof is complete. □ 

Example 5.3. Let us give an example of a probability measure on R°° with integrable 
logarithmic derivatives which is typical for applications and satisfies the assumption 
of the above theorem. We consider Gibbs measures on a lattice R z , which can be 
formally written in the following way 

(25) i/="exp(- Vk(x k )- W kA x k,Xj)) dx", 

where " dx" denotes infinite-dimensional Lebesgue measure on R z (which does not 
exist). The following existence result has been established in [T]. Assume that there 
exist a number N > 2 and a symmetric matrix J = {Jk,j} k j £ i Rd sucn that 

Wk,j(x k ,Xj) = Wj t k(xj,x k ) 

\W k j(x k ,Xj)\<J k j(l + \x k \ + \xj\) N 
\d Xh W k!j {x k , Xj )\ < J fcj (l + \x k \ + \xAf~ 1 
\V k (x k )\ < C(l + \x k \) L , \d Xk V k (x k )\ < C(l + l^i)^ 1 

d Xk V k (x k )-x k > A\x k \ N +°-B 

for some A, B, C, a > 0, L > 1. 

The matrix J is also assumed to be fastly decreasing (see [Tj for details), in 
particular the finite range case J k .j = if \j — k\ > No for some Nq is included. 
Then there exists a probability ("Gibbs") measure v on R z<i with exponentially 
integrable logarithmic derivatives 

k = d Xk V k (x k ) + d Xk (W kd + W jtk ) k G Z d . 
jei d 

It was shown in [1] that such v is a rigorous definition of v in (|25[) via the Dobrushin- 
Lanford-Ruelle equations. See [2] for uniqueness results. 

6. Uniqueness in the Gaussian case 

The following result was essentially established in [7]. We give below a slightly 
modified version with a sketch of the proof. 

Theorem 6.1. Assume that there exist p > 1, q > 1 such that \\c\\ G L p {-^), 
\\Dc\\ns G L«(j), div 7 cG L"(j). 

Then for every to > there exists at most one solution to 0) satisfying 

SUP \\p(t, OlUt-y) < OO, 
0<t<t 

where r > max(p* , q*) . 

If, in addition, d < oo and p > q, the assumption \\Dc\\-us G L q (-y) can be 
replaced by \\Dc\\ G L q loc (l)- 
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Sketch of the proof. We discuss only the case d < oo (the proof for d = oo is 
almost the same). Fix a non- negative C§°(M d )-function <p. Let p be a solution to 
@. Set: p = ip ■ p (this is important only for d < 00, since in this case we apply 
the local assumption). Take any ip € C^°(K. d ) such that -0 = 1 on supp(^) and set 
c = ijj ■ c. Note that p solves the following equation 

d 

It' 



-/5 + div 7 (c,p) = (V(p,c)p. 



Let us smoothen p with the Ornstein-Uhlenbeck semigroup: p e — e £ T £ {p). One 
has 

—p £ + div 7 (c • p £ ) = e~ £ r £ {p,c) + p £ div 7 (c) + e~ s T e {{V<p, c)p), 

where r £ (v,b) = e £ (b, V(T e (v)) ~T e (dW y (v -bj). 

The uniqueness proof relies on the concept of the so-called renormalizing solu- 
tions. Take a continuously differentiable globally Lipschitz function v and using 
smoothness of p £ compute jjjv(p £ ): 

j t <P e ) + div 7 (c • v(p £ )) = [v(p £ ) - p £ v'{p £ )} ■ div 7 (c) 

+ v'(p £ )(e- £ r £ (p,c)+p £ div 7 (c) + e- £ T e ((V^c)p)). 

According to estimate (68) from [7] there exists C = C(p,q) such that for r — 
max(p* , q* ) and small values of e one has 

||r £ || L i( 7 ) < C\\p\\ L , {l) {^\\c\\ LP(l) + ||div 7 c|| L5(7) + \\{Dcyy m \\ Lq{1) ). 

It follows from the assumptions of this theorem that the right-hand side is finite. 
In addition, r £ — > — div 7 (c) • p in L 1 ^) as e — > (Proposition 3.5 in [7])- 
Passing to the limit one obtains that 

foV(p) + div 7 (c • v(p)) = [v(pj - pv'(p)] ■ div 7 (c) + v'(p)({V<p, c)p) 

in the distributional sense (i.e., p is a renormalizing solution). Assume that there 
exists two different solutions pi , P2 in U (7) with the same initial condition. Ap- 
plying this relation to the difference p = p\ — p2 and v(t) = max(0,t), we get that 
for every ip £ C^°(K d ) one has 4(y • p+) + div 7 (c • (p)+<p) = (Vy, c)p + . Finally, 

^p+ +div 7 (c • (p)+) = 
in the distributional sense. Clearly, A J p+ o?7 = 0, hence J p + dj = and p = 0.U 

7. Examples of uniqueness 

In this section we study uniqueness problem for transport equations. As in 
Theorem 15.11 we reduce the proof to the Gaussian case (see Theorem 16. ip . 
Recall that 

c = DS(T)-b(T). 

Since the assumption on the divergence can be directly transferred, we need only 
to find some sufficient conditions for 

||c||, \\Dc\\ HS G L p ( 7 ). 

One can try to apply the trivial operator norm estimate 

||c||<||^(T)||||6(T)||. 
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Let us stress, however, that operator norm estimates do not seem to be available 
in the case of triangular mappings (unlike optimal transportation ones). In spite 
of this let us give another estimate of c which does not use operator norms. 

Lemma 7.1. For every 1 < p < 2 and q > 1 one has 

l oo P l 2 

lb,]™ du) 



|cf d 7 <C(p,q) [sup / f3f q dv 
Proof. Trivially we have 

oo 

\\c\\<J2\bi(T)\-\\d ei S(T)l 

Applying the inequality 



i=l 



i i 

which holds for every q > 1, we get for every 1 < p < 2 

oo oo 

\\4 p = (E • ii^cmi)* < (E i^ T )i f • ii^cnii* 



2=1 
OO 



= E IWI* • 16,(7)15 . \\d ei S(T)\\% ■ \\d £j S(T)\\ 
By the Holder inequality and Proposition 13. 101 



oo 



d 7 < E(/ l^(r)|^ -1^(7)1^^) 9 (/ ||5 e ^(T)||^.||9 e ^(T)||^d 7 



□ 



< E ( / im*"* 29 * • ( / h\ pq * 29 * • ( / u^r 29 • ( / iimii" 29 

< C(p,g)(sup J /3f d,/)' ■ [E(y N P9 * A/)' 5 ' 
7.1. Product case. 

Theorem 7.2. Assume that v is a product measure on K°° 

oo 

v = Y[e- w ' {x ' ) dx t . 

i=l 

Assume that for some e > 0, <5 > 0,1 < p < 2, q > 1 
1) 

, , M . (-l + S)e 

tUi\t) > — ■, — — — r for every i 

pq* (pq* + e) — £ 

and 

sup J (3f q ' +e dv<oo; 
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2) \\Db\\ HS G U\v) and 



\h\ vq dp) q + J2{ I \9*M pq d ») q < °°; 

3) div v (b) £ L?(i/). 

T/ien ||c||, ||Z?c||#s S ^(l)- ^ n particular, for every to > f/iere exists at most 
one solution to the equation satisfying 

sup IIp^OIIlp*^) < o°- 

0<t<t o 

Proof. First we check that the assumptions of Lemma 14.21 and Proposition 14.41 are 
satisfied. This is clear except for the estimate sup 0<t<t \\p ■ bi\\ L ( pg *)* < oo. To 
prove this we apply the Holder inequality \\p ■ bi\\ L ( pq *)* = \\p ■ bi\\ LPq /( Pq - q +i) < 

\\p\\LP*\\ b t\\LPi- 

By Theorem 16.11 and Proposition 14.41 the problem is now reduced to the unique- 
ness problem in the Gaussian case. 

Thus, it is sufficient to show that ||c|| , ||-Dc||#s € L p (^) for some p > 1. Since we 
deal with a product measure, the transportation mapping has a simple structure 

T = (T 1 (x 1 ),T 2 {x 2 ), ■ ■ ■ ,T n (x n ),- ■ ■). 

Hence 

c, = d Xi Si{Ti)bi{T). 

We apply Lemma [7.1 1 Note that that in this case d e .Si = 0. Taking this into 
account and following the proof of Lemma l7.1l we can get a more precise estimate: 



\crd 7 <C(p,q)J2( \bi\ P9 dv)~ q •( hf 

r l oo /> 

<C(p, q ) S np(j I3f dp) I 6 ' 



dv 



pq dvY , Kp<2. 



Let us estimate DC. Taking into account that Si and Ti are reciprocal, one 
easily gets 



and 



One can estimate || (c^Cj)!! in the same way as ||c|| and applying Proposition 
13. 151 one gets 

l oo 



\\{d Xi Ci)\\ p (h<C(p) / \\(d x V)\\P dz/ + sup 



dvY J2[ I^T dv 



<C(p,q)[ \\(d Xi b l )\\ P dp + Sup{\\^\\ LP ^ + , H ) J2( \ b i\ Pq dl> 
J i i=1 J 



Similarly 



|-B||^ S d-y < sup( 
i,3 



d x , Sj 
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dv 



£(/ \ d *M m dP 



where B itj = d x .Ci, i ^ j, B v 
It remains to estimate 



da:. Si 



d X] Sj 



PQ 



dv < 



[d Xi Si 

[dxA 



]pq +e 



]pq*+e 



dv 



dv 



PI* _ n _ 


pq* +£■ 




pq* 




pq* 4-e 


[/[ 



d Xj Sj 



pq* (e+pg* ) 



dv 



pq* (e+pg* ) 



d~j 



According to Proposition 13.71 and Proposition 13.101 we get that the right-hand side 
is finite if \Pi\tri*+^{u) < 00 an d tcj'^t) > pq }[ e ]^ p S q l e ) _ £ ■ The proof is complete. □ 

Corollary 7.3. Let v be as in Theorem \ 7. S\ satisfying tw'^t) > 0. Assume that 
1) 



sup 



{w'iit^e-^dt < 
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for every N > 1; 

2) for some 1 < p < 2 and q > 1 one /ias (d Xi bi) € L p (v); 

3) div^(6) e L p (z/) and 



E 



E(y ^J" <o °- 



T/ien ||c||, ||Dc||ijs € L p (-~{). In particular, for every to > i/iere exists at most 
one solution to the equation satisfying 

sup IIp(*,-)IIlp*(«/) < °°- 

0<t<t 

7.2. Gibbs measures. In this section we prove uniqueness for the measures de- 
scribed in Example 15.31 More generally, we will assume: 

Assumption (A): There exist smooth functions Vi(xi), Wij(xi, xj) such that 

oo 

(26) Pi = V-(xi) + ^2 9 Xi W it j 

3=1 

and there exists Nq > 1 such that Wij = if \i — j\ > Nq. 

Clearly, in this case the corresponding mapping S has a special structure 

S(xi,--- , x n , ■ ■ ■ ) = (Si(xi), S 2 {xi, x 2 ), ■ ■ ■ , S n {x n ^ No , ■ ■ ■ ,x n -i,x n ), ■■■)■ 

Example 7.4. Let us consider a Gibbs measure v =" e~~ dx" with Hamiltonian 

oo oo 

Under the assumptions of Remark 15.31 there exists a unique Gibbs measure v sat- 
isfying (|2"rJ|) . as explained above. 

We will also need the following 1-dimensional version of the Caffarelli contraction 
theorem (which holds true for optimal transport mappings in any dimension, see 

M, 1231). 
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Theorem 7.5. Let T : ffi — > Hi be the canonical increasing mapping pushing forward 
a probability measure e~ v ( x ) dx onto a probability measure e~ w ^ dx. Assume that 
V and W are twice continuously differentiable and V" < C, W" > K. Then T is 

Lipschitz satisfying T' < \J~% 

We recall that a probability measure \i on M. d is called log-concave if it has 
the form e~ v ■ "H \li where H is the fc-dimensional Haussdorff measure, k € 
{0, 1, • • • , d}, L is an affine subspace, and V is a convex function. We call a measure 
/i uniformly log-concave if ^e K ' x ' -fj, is a log-concave measure for some K > and a 
suitable renormalization factor Z. It is well-known (C. Borcll) that the projections 
and conditional measures of log-concave measures are log-concave. The same holds 
for uniformly log-concave measures. We can extend this notion to the infinite- 
dimensional case. Namely, we call a probability measure fiona locally convex space 
X log-concave (uniformly log-concave with K > 0) if its images /i o £ , I g X* , 
under all linear continuous functionals are all log-concave (uniformly log-concave 
with K > 0). We will also use the fact that the (one-dimensional) conditional 
measures of uniformly log-concave measures are uniformly log-concave with the 
same constant. 

Theorem 7.6. Assume that assumption (A) is satisfied and 

1) for every n > 1 

snp J (|ftp + HVftir) d/Koo; 

2) v is uniformly log-concave; 

3) for some 1 < pa < 2, qo > 1 

£(/ N P09 ° ^) 2 ™ <oo; 

■i— 1 

4) for some 1 < p < 2 

OO „ 2-p 

T/ien ||c||, ||Z?c||#,g € L q (-~{) for some q > 1. //, m addition, div„(6) e L q (v), then 
for every to there exists at most one solution to (0j satisfying sup 0<t<to \\p{t, < 
oo. 

Remark 7.7. a) According to results of [T| there exist probability measures satisfying 
the assumptions of the theorem (see Remark l5.3[) . In particular, 1) is automatically 
satisfied for these measures. 

b) We believe that the factor k? in 4) can be removed. This factor arise just 
because we deal with triangular transportations and the quantity VXfc is difficult 
to control (see Lemma 17.81 ) . One can get a better control applying optimal trans- 
portation mappings. Unfortunately, the existence of such a mapping in the infinite 
dimensional case for mutually singular measures is an open problem to the best of 
our knowledge. 

c) The assumption of uniformly log-concavity of the Gibbs measure v can be 
expressed in terms of the potentials Vi , Wij . Note that it is sufficient to require the 
uniformly log-concavity of the approximations. 
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Proof. It is easy to check that b, v, and p satisfy the assumptions of Lemma r4.2[ 
Lemma [4.31 and Proposition 14.41 Thus the problem is reduced to the uniqueness 
problem in the Gaussian case. We have to show that ||c||, ||Z?c||#s G L q (j) for 
some q > 1. The first part follows from Lemma l7.ll The second part follows from 
Lemmata 17. 8i [71)1 Indeed, note that 



(27) Dc = (DS ■ Db ■ (DS)- 1 ) oT+fS^bf (DS Xi )(DS)- 1 ) o T. 

i=l 

In the same way as in Lemma 17.11 and applying Lemma 17.91 we get the desired 
estimate for J £2°^ 1 bi(DS Xi )(DS)~ 1 . Note that g x „ = d Xi Ti(S) is bounded by 

as a one-dimensional optimal mapping of a Gaussian measure onto a uniformly 
log-concave measure by the Caffarclli theorem. Here we use the fact that the 
corresponding conditional measures are uniformly log-concave. 

We apply Lemma [7.81 to estimate DS ■ Db ■ (DS)^ 1 . To complete the proof we 
need to estimate J" | VT^ | 2 dj. Indeed, since v is uniformly log-concave, we can 
apply Remark [XH We get K J \d Xi T\ 2 dv < J x\ dy. Hence 



k k k 

J ||VT fc || 2 d 7 = J J2(d^T k f d 1 < J2j \\d Xi Tf dl<^i2j 



2 j ^ 

x i d T = K' 



□ 



Lemma 7.8. For every 1 < p < 2 there exists C, depending on P,Nq, and 
sup 4 / |/3j | dv such that 

p OO ^ 2 — p p 

J \\DS-Db-{DS)- x Y HS <c[Y,(j \dM& d") 8 (y livr fc || 2 d 7 



3,k=l 



Proof. We estimate ||£>5 • Db ■ (DS)- 1 ^ = Y,Zi \\ DS ' Db ' (DS)' 1 
For simplicity set M = DS ■ DB, L = (DS) -1 . Then 

/OO p „ OO OO 2 P 

(^TWMLe^y dv< \{T^\UA\\M-e k i) f dv 

»=1 J i=l k=l 

OO OO p „ oo oo 



(', 



2 



< / [E E \ L iA L i,i\\\M ■ e k \\\\M ■ ej \\Y dv< I Slii.Jk^l^^llAf-ejkl^llM-ejH* dv 

i—1 k,j — l k,j—l i—1 

OO p 2 — p p OO p 

< (/ ||M.e fc ||^||M. ej ||^ dv) 2 U Y\ L <M\Li,i\ drf 

k,j= 1 i—1 

OO p 2 — p p 2-p p OO p _ OO 

<Y[{ W-^dv) 4 (|||M.e,||^) 4 (/El^l 2 ^) (/E 

k.j — 1 ' i—1 i—1 

OO „ 2-p p oo 

= (E(/ll^ll*^) 4 (/ei^i 2 < 



Furthermore, 



oo oo n k p p 

E j \ua 2 dv = Ey i(^) life | 2 d 7 = Ey i^i 2 *y = y nvT fe n 2 d 7 



i — 1 2—1 2—1 
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and 

oo Nq 



||M-e fe ||* dv = (j2(£ d ^ S i-r- d ^ b i-rf)' P dv 

J i=l r =0 
„ oo JV 

oo N „ 1/2 f 

< C(p,N )Y,Y,{ \9 x A-r\^ dv) ( \d Xk bi. r \^ dv 

2 X ^ 2 oo 

<C(p,iV ) S up( J] U^S^dv) )Y;{ \ d *M 4 - 



j=i-N 3=1 



1/2 



oo . 

'E(/i^ 

3=1 J 



1/2 



In the last estimate we apply Proposition 13.141 and the special structure of S 
Finally, we obtain 

p OO ~ 2 — p « 

y ||£>S. J D6.(D5)- 1 ||^ s <c[ ^(y fod&du) 8 (y||VT fe || 2 rf 7 



2 



3",fe=l 

□ 

Lemma 7.9. Assume that for every p > 1 

su P y(iftr + iivAir + ^) **<c(p). 

T7ien supj J || (DS Xi )(DS) dv < D with D depending on p and No. 

Proof. Due to the special structure of S 1 the matrix -DS , Xi only has a finite number 
of non-zero entries. The result now follows immediately from Remark 13.141 and 
Propositions l3~T5l l3~To1 □ 

8. Finite-dimensional case and optimal transportation 

Instead of triangular mappings one can also apply the optimal transportation 
mappings. For a detailed account on optimal transportation see |13) . [30] . In this 
case the available a-priori estimates are essentially better in many respects. For 
instance, there exist L p -estimates on operator norms of DT which do not depend on 
dimension (see [25], [13]). Unfortunately, this approach has certain disadvantages: 

1) unlike the triangular mappings, the optimal transportation mappings do not 
have an explicit form and the a-priori estimates for them are usually hard to prove, 

2) the existence problem for optimal transportation mappings in infinite dimensions 
is solved in sufficient generality only for the case when the measures /i and v have 
a finite Kantorovich distance W^[}i, v) (see [21], [22])). If fx = 7 is Gaussian, this 
limitation means that basically we should restrict ourselves to measures which are 
absolutely continuous with respect to 7, i.e. v = g ■ 7 and, moreover, have finite 
entropy, that is/ g log g d 7 < 00 . 
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Remark 8.1. Some new existence results on optimal transportation of certain Gibbs 
measures are obtained in [35] . These results together with estimates from [35] , [T3] 
can be used to obtain infinite-dimensional uniqueness/existence statements for the 
case where v is uniformly log-concave. But we don't consider this approach in this 
paper. 

We assume in the rest of this section that d < oo. We consider the optimal 
transportation mapping T pushing forward the standard Gaussian measure 7 onto 
v = e~ w dx. In particular, T has the form T — V^E 1 , where ^ is a convex function. 
The inverse mapping S — T^ 1 is optimal too and has the form S = V$, where $ 
is the convex conjugate to 

The drifts c and b are related in the same way as above 

c = DS(T) ■ b(T) = L> 2 $(T) • b(T). 

Let us illustrate how our methods work in the finite-dimensional case. 

Theorem 8.2. (Uniqueness) Assume that W is locally Holder, \\Db\\ € L 2 oc {v), 
\b\ € L 4 (v), \VW\ G L A {v). Then ||c|| G L 2 ( 7 ), ||D 2 c|| £ tf oc (>y) and for every t 
and every fixed initial condition po £ L 2 (v) there exists at most one solution to (0) 
satisfying sup < t < to \\p(t, -)\\l^{u) < 00. 

Remark 8.3. 1) Note that we do not need any bounds on the second derivatives of 
W. 

2) The assumption of Holder continuity is made only to assure high enough local 
integrability (even boundedness) of ||Z) 2 <I>|| ■ ||(Z) 2 $) _1 ||. We believe that this can 
be achieved in some other (more efficient) way without Holder continuity. We note, 
however, that Sobolev estimates for optimal transportation in W,'^ with big p are 
hard to prove. See in this respect the recent paper |27j and the references therein. 

3) Some estimates applied in the proof are valid in the infinite-dimensional set- 
ting. 

Proof. First we note that the second partial derivative d ev $ of $ is locally Holder 
for any vectors e,v & R d , because both measures v and 7 have Lebesgue densities 
which are locally Holder. This follows from the well-known results of Caffarelli [17] 
(see |24] for technical improvements for unbounded domains). Clearly, the same 
holds for D 2 $. Let us apply Proposition 14.41 We need to show that ||c|| € L 2 (-f), 

l|£c||eL 2 oc ( 7 ). 

For ||c|| one has the trivial estimate 

||c||< \\D<S>(T)\\- \\b(T)\\, 

which implies 

2 J ' \\c\\ 2 dv < J \\D 2 <5>\\ i dv + J H&ll 4 dv. 

By Theorem 6.1 from [25] assumption ||VW|| € L 4 (v) implies that ||-D 2 $|| E L 4 (v). 
Hence ||c|| € L 2 {v). 
Let us estimate Dc: 

Dc = [L> 2 $ • Db ■ (D 2 ®)- 1 ] (V*) + [( J D 2 $) 6 ( J D 2 $)" 1 ] (V*), 

where for brevity we set 

d 

»=i 
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Using the standard HS-norm inequalities 

\\AB\\ HS < \\A\\ ■ \\B\\ HS , \\BA\\ HS < \\A\\ ■ \\B\\ HS , 
applied to an arbitrary A and symmetric B, we get 

\\Dc\\ HS < (^CllDbllns + C 1 / 2 ■\\(DH)-^\DH) b (DH)- 1 /^ Hs yV^), 

where C = \\D 2 ^\\ • ||(.D 2 $) -1 ||. 

Since ||_D 2 $||, ||(_D 2 $) _1 || are locally bounded functions, we only need to locally 
estimate 

||(£) 2 $)" 1 / 2 (D 2 $)6(Z) 2 $)" 1/2 || H s; 
To this end we apply the following inequality 
(28) 

J (D 2 W ■ b, b)r) dv > J \\D 2 <5> • b\\ 2 ri dv + J ((D 2 $) b ■ 6, (D 2 <£>) -1 V?7) dv 

+ 2 J Tr({D 2 $) b -Db-(D 2 <S>)- 1 )r ] dv + J \\ {D 2 ^ 1 ' 2 {D 2 ^) b (D 2 <^)- 1 / 2 \\ 2 HS i 1 dv, 



V;; 



P 

r\ dv < oo for a suffi- 



proved in Lemma 7.1 [25]. Here r) € C£°{R d ) with J 
ciently big p. 

Since we do not assume existence of D 2 W, we apply the integration by parts 
formula to get rid of this: 

J (D 2 W ■ 6, b)r) dv = J (VW, b) 2 r] dv - J (VW, 6)div„6 ■ r\ di/ - ^ (VW, Db ■ b)r] dv 
{VW,b){VW,Vr]) dv. 



Clearly, the right-hand side is finite by the Cauchy inequality. 
The elementary estimates 

(29) J ((D 2 <£) b ■ b, (DH)- 1 ^) dv < e J ^D 2 ®)- 1 / 2 ^ 2 ®)^ 2 ®)- 1 ' 2 ^ dv 

+ C(d,e) [ ||£|||6| 2 ^^ dv, 
J V 

and 
(30) 

2 J Tr({D 2 t>) b Db- {D 2 ®)- 1 )^ dv < e J \\{D 2 ^)- 1 ' 2 {D 2 ^) b {D 2 ^)- 1 ' 2 \\ 2 HS r ] dv 

+ C(e) J \\C\\\\Db\\% sV dv. 

imply that \\(D 2 ^/ 2 (DH) b (D 2 ^/ 2 \\ 2 HS e L 2 oc (v), hence ||Dc|| € L^M • □ 

The following theorem is formulated in a " dimension- free" manner. This means 
in particular that this formulation makes sense in the infinite-dimensional setting. 
Actually, we believe that an appropriate generalization of the theorem always holds 
in the infinite-dimensional case provided the corresponding optimal transportation 
map of v to 7 does exist. 
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Theorem 8.4. (Existence and uniqueness) Let is be a probability measure with 
Lebesgue density on M. d . Assume that v is a uniformly log-concave measure, so, in 
particular, there exists K > such that D 2 W > K Id in the interior of the support 
of v. Assume that W belongs to W 2 ' p (v) for some p > 1 and, moreover, 

d Xl W&L 2p (v), l<i<d, \\D 2 W\\ P G L l {v). 

In addition, we assume that 

J (D 2 Wb, b) dv < oo, b t G 0), 1 < i < d, 



and 



div v b G L 2 {y), ||£>6||# G L». 



Then 

1) under the additional assumption that e e ( dlv,yb )- G l}(y), there exists a solu- 
tion to 0) for any initial po G L 2+£ (v) satisfying sup 0<t<4[) \\p(t, - )\\l 2 (v) < 
oo; 

2) any two solutions to fl|) satisfying sup 0<t<t \\p(t, •)||x ) a( I/ ) < oo with the 
same initial condition po coincide. 

Proof. The proof follows the same line as the proof of the previous theorem. Ac- 
cording to the Caffarelli's theorem || (£> 2 <i>) _1 || < -7=. According to a result of [25] 

/ ||D 2 $|| 2p dv < J ||£> 2 W|| P dv < 00. This implies that L G L 2p {v). 

We use the estimates from the proof of the previous theorem. By Holder's 
inequality J \\C\\ ||-Dfr|j#5 dv < 00. Applying that J (D 2 Wb, b) dv < 00 we get from 
(|2"g]l and that 

J ||(D a *)- 1 / 2 (U 2 4) 6 (D a $)- 1 /a||^ s d l / < 00, J ||D 2 $ • b\\ 2 dv < oo, 

and, finally |jc|| 2 , H-DcH 2 ^ G L 1 ^). The uniqueness statement follows easily from 
Proposition 14.41 with the help of Holder's inequality. 

For proving existence, we cannot use Theorem 15 . 1 1 directly, because assumptions 
1) and 4) are not fulfilled in general. Note, however, that we need 1) and 4) only 
to apply Proposition 14.51 But the statement of Proposition 14.51 holds trivially in 
our case because of the uniform bound ||(I? 2 $) _1 || < (sec the proof of Lemma 



P|) . □ 
Example 8.5. Let 

d K 2 

v = p(x)dx, p(x)=CY[e *i 2 I{ Xi>0 }- 

i=l 

Note that v = e~ w is uniformly log-concave and ||VW||, ||.D 2 W|| belongs to L p (v) 
for any p > 0. Fix an arbitrary q > and set b = j^- It is easy to check that 
div„b is bounded from below and the other assumptions on b are satisfied. Hence 
for every po G L 2+£ (v) there exists a unique short-time (even long-time) solution 
to ©. 

Note that if q is big, then b is not a BV function with respect to Lebesgue 
measure. This makes inapplicable the finite-dimensional theory from [5]. 
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